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Abstract 

A two-rowed array a„ = ( ?^ ?^ ' ' ' ?" ) is said to be in lexicographic order if a^. < au+i 

\0i 02 • • • On/ 

and 6fe < 6^+1 if a^. = flfc+i- A length I (strictly) increasing subsequence of a„ is a set of indices 
ii < i2 < ■ ■ ■ < it such that hi^ < hi^ < . . . < bi^. We are interested in the statistics of the 
length of the longest increasing subsequence of q;„ chosen according to 2?„, for distinct families of 
distributions V = (I',i)„eN, and when n goes to infinity. This general framework encompasses well 
studied problems such as the so called Longest Increasing Subsequence problem, the Longest Common 
Subsequence problem, problems concerning directed bond percolation models, among others. We define 
several natural families of distinct distributions and characterize the asymptotic behavior of the length of 
a longest increasing subsequence chosen according to them. In particular, we consider generalizations 
to d-rowed arrays as well as symmetry restricted two-rowed arrays. 



1 Introduction 

Suppose that we select uniformly at random a permutation vr of [n] =^ {1, . . . , n}. We can associate to 

/ 1 2 ... n \ 

TT the two-rowed lexicographically sorted array a-j^ = { / x ■ We denote by lisii:) 

V7r(l) 7r(2) ... ^(n)y 

the length of a longest increasing subsequence of a.,^. The determination, as n — )• 00, of the first moments 
of lis{7r) has been a problem of much interest for a long time (for surveys see ||AD99[ IOR98[ ISta02]| and 
references therein). This Une of research led to what is considered a major breakthrough: the determination 
by Baik, Deift and Johansson [BDJ991 of, after proper scaling, the distribution of lis{-). In BBROll . varia- 
tions are studied where instead of permutations of [n], random involutions, signed permutations, and signed 
involutions are selected at random. Generalizations where d — 1 random permutations are selected can be 
restated as problems concerning longest increasing subsequences of d-rowed arrays. 

Suppose now that we select uniformly at random two words fi and v from S", where S is some finite 
alphabet of size k. We can associate to (/i, u) the two-rowed lexicographically sorted array where 

is a column of a^,i/ if and only if the i-th character of fi is the same as the j-th character of u (for an 

example, see Figure[T]l. The length of a longest common subsequence of /i and u, denoted lcs{fi, u), equals 
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Figure 1 : Lexicographically ordered two-rowed array associated to words abaca and bbaca 



the length of a longest increasing subsequence of a^^jy. Since the mid 70's, it has been known IICS75II that the 
expectation of lcs{fi, u) when normalized by n, converges to a constant 7^ (the so called Chvatal-Sankoff 
constant). The determination of the exact value of 7^, for k fixed, remains a challenging open problem. To 
the best of our knowledge, the asymptotic distribution theory of the longest increasing subsequence problem 
is essentially uncharted territory. Generalizations where d random length n words are chosen from a finite 
alphabet S can also be restated as problems concerning longest increasing subsequences of d-rowed arrays. 

We now discuss yet one more relevant instance, previously considered by Seppalainen |Sep97p , and 
encompassed by the framework described above. Fix a parameter < p < 1 and let n be a positive integer. 
For each site of the lattice [n]^, let a point be present (the site is occupied) with probability p and absent 
(the site is empty) with probability q = I — p, independently of all the other sites. Let uj : [n]^ — )• {0, 1} 
be an encoding of the occupied/empty sites (1 representing an occupied site and a vacant one). We can 

associate to uj a two-rowed lexicographically sorted array auj where is a column of a^i if and only 

if site G [n]^ is occupied. Let L{uj) equal the number of sites on a longest strictly increasing path 
of occupied sites according to co, where a path (xi, yi), (x2, 1/2), ■ ■ ■ (x-m, Um) of points on [n]^ is strictly 
increasing if xi < X2 < • • • < Xm and yi < y2 < • • • < Um- Observe that L{uj) equals the length of 
a longest increasing subsequence of a^j. Subadditivity arguments easily imply that the expected value of 
L{ijj), when normalized by n, converges to a constant 7p. Via a reformulation of the problem as one of 
interacting particle systems, Seppalainen | |Sep97| shows that 7p = 2^/(1 + ^/p). Also worth noting is 
that the same object aui arises in the study of the asymptotic shape of a directed bond percolation model 
(see | Sep97[ §1] for details). Symmetric variants, where for example site (i, j) is occupied if and only if 



(j, i) is occupied, can be easily formulated. Generalizations where d-dimensional lattices are considered 
can also be restated as problems concerning longest increasing subsequences of d-rowed arrays. However, 
to the best of our knowledge, neither of the latter two variants has been considered in the literature. 

Thus far, we have described well studied scenarios where the general problem formulated in the abstract 
naturally arises. This motivates our work. However, for the sake of clarity of exposition and in order to 
use more convenient notation, it will be preferable to reformulate the issues we are interested in as one 
concerning hyper-graphs. To carry out this reformulation, below we introduce some useful terminology and 
then address in this language the problem of determining the statistics of the length of a longest increasing 
subsequence of a randomly chosen lexicographically sorted d-rowed array. 

Let Ai, . . . , A^he. d disjoint (finite) sets, also called color classes. We assume that over each Ai there 
is a total order relation, which abusing notation, we denote < in all cases. When we consider subsets of 
a totally ordered color class we always assume the subset inherits, and thus respects, the original order. 
A d-partite hyper-graph over totally ordered color classes Ai, . . . ,Aci with edge ?>et E Q Ai x . . . x Ad 
is a tuple G = {Ai, . . . , Ad] E), and its edge set is denoted by E{G). For A[ C Ai with 1 < i < d 
and hyper-graph G = {Ai, . . . , Ad] E), we denote by GL/ x xA' hyper-subgraph induced by G in 

1 "■ d 

A'^x . . .X A'^, i.e. the hyper-graph with node set V' = A[ x . . . x A'^ and edge set E n V. We say that two 
hyper-graphs are disjoint if their corresponding vertex sets are disjoint. Let KAi,...,Aa denote the complete 
d-partite hyper-graph over color classes Ai, . . . ,Ad whose edge set is Ai x ^2 x • • • ^d- Henceforth, we 
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denote the cardinality of Ai by Wj. If we identify Ai with [rii], then we write Kn^^,__^n^ instead of -f^Ai,...,Ad- 
If ni = . . . = Ud, then we write K^i^"^ instead of i^ni,...,nd- Over the edge set of KAi,...,Ad we consider the 
natural partial order relation ^ defined by 



We say that a collection of node-disjoint edges M C E{G) is a non-crossing hyper-matching if for every 
pair of edges e, / G M it holds that e ^ / or / ^ e. When G = {Ai, . . . , Ad; E) is such that E{G) 
is a non-crossing hyper-matching we will simply say that G is a non-crossing d-partite hyper-graph, or 
simply a non-crossing hyper-matching. Furthermore, we will denote by L{G) the size of a largest non- 
crossing hyper-matching of G and by L{T) the random variable L{G) when G is chosen according to a 
distribution T over d-partite hyper-graphs. When we want to stress that we are dealing with only two color 
classes, we will speak of graphs and matchings instead of hyper-graphs and hyper-matchings. 

Now, consider a family of distributions T> = {'D{KAi,...,Aa)) where each 'D{KAi,...,Aa) is a probability 
distribution over subgraphs of ifAi,...,Ad- In this work we are interested in understanding what we refer to as 
the Longest Non-crossing Matching problem, i.e. the behavior of the expectation of L{G) when G is chosen 
according to various distinct families of distributions V = (V{Kn^)) and n goes to infinity. Of course, 
in order to be able to derive some meaningful results we will need some assumptions on the distributions 
V{K!a^). Below, we encompass in a definition a minimal set of assumptions that are both easy to establish 
and general enough to capture several relevant scenarios. 

Definition 1 LetT> = {V{KA'i^,...,Ad)) be afamily of distributions where each 'D{KAi^,...,Aa) ^ probability 
distribution over the collection of hyper-subgraphs ofKAi,...,Ad- We say that T> is a random d-partite hyper- 
graph model if for H chosen according to 'D(^KA-i^,...,Ad) the following two conditions hold: 

1. Monotonicity: If A'^^ C Ai with 1 < i < d and n[ = \Ai\, then the distribution of H\^, ^ is 



2. Block independence: IfA'-,A'f C A^ are disjoint with 1 < i < d, then H' = H\j^,^^ and 
H" = H\^„ ^„ are independent (and so, L{H') and L{H") are also independent). 

1 ■■■ d 

For some of the results we will estabUsh, the following weaker notion will suffice. 

Definition 2 Let V = {I){K^'')) be afamily of distributions where each 'D{Kn'') is a probability distribu- 
tion over the collection of hyper-subgraphs ofK^\ We say that V is a weak random d-partite hyper-graph 
model if for H chosen according to 'D{K^^) the following two conditions hold: 

1. Weak monotonicity: If A' C [n], \ A'\ = n', then the distribution of H\^,^ isV{K^^). 

2. Weak block independence: If A', A" C [n] are disjoint with I < i < d, then H' = H\j^,^ and 
H" = H\^„y. y.A" <^f^ independent (and so, L{H') and L{H") are also independent). 

The reader may easily verify that the following distributions (on which we will focus attention) give rise to 
random d-partite hyper-graph models: 

• S(Kni,...,nd) ^) (the random d-word model) — the distribution over the set of hyper-subgraphs ob- 
tained from Kn-i,...,na when each element in the vertex set of i^ni,...,nd is uniformly and independently 
randomly assigned one of k letters and where edges, for which not all of its nodes end up being as- 
signed the same letter, are discarded. 




■0- 
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Figure 2: A bipartite graph in the support of S{Ki2^i2,p)- 




Figure 3: A bipartite graph in the support of A{Ki2,i2,p)- 

• G{Kni,...,naTP) (the d-dimensional binomial random hyper-graph model) — the distribution over the 
set of hyper-subgraphs H of Kn^,...,^^ where the events {H\e ^ E{H) } for e G E{Kn^^,,,^nci) have 
probability p and are mutually independent. 

The model Ti{Klf^ , A;) is refen^ed to as the random word model because it arises when one considers the let- 
ters of d words uji, . . . ^ujd of length ni, . . . , n^, respectively. The letters in each word are chosen uniformly 
and independently from a finite alphabet of size k. Then, each word is identified with a color class of a hyper- 
subgraph H of Kni,...,nd whose hyper-edges are the {vi, . . . , v^) G V{H) for which vi, . . . ,Vd have been 
assigned the same letter. It is easy to see that the longest common subsequence of cji , . . . , equals £ if and 
only if L{H) = £. The random word model thus encompasses the Longest Common Subsequence problem 
discussed above. Similarly, the attentive reader probably already noticed that the binomial random graph 
model also encompasses the already discussed point lattice process considered by Seppalainen | |Sep97 ]. 

Inspired in the work of Baik and Rains BBROll cited above, where symmetric variants of the Longest 
Increasing Subsequence problem were considered, we will also study the following two symmetric variants 
of the binomial random graph model: 

• S{Kn,n,p) (the symmetric binomial random graph model) — the distribution over the set of subgraphs 
H of Kn.n where the events {H \ {i, j), (j, i) G E{H) } for 1 < i < j < n, have probability p and 
are mutually independent. 

• A{K2n,2n,p) (the anti-symmetric binomial random graph model) — the distribution over the set 
of subgraphs H of i^2n,2n where the events { H \ (2n — i + 1, 2n — j + 1) G E{H) } for 
1 < ^, i < 2n have probability p and are mutually independent. 

Note that {S{Kn,n,p))ne'N is not a random model according to Definition[T| but it is a weak random model 
according to Definition[2] On the other hand, {A{K2n,2n, p))neN is not even a weak random model. 

Henceforth, given a random bipartite graph model V = {!){■)), any value that is constant across the 
distributions D(-) will be called internal parameter of the model — e.g. 1/p and k in Q{-,p) and A;), 
respectively. 

The main purpose of this work is to establish a general result, referred to as Main Theorem, with a 
minimal set of easily verifiable hypothesis, that characterizes the limit behavior, when properly normalized. 
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of E L{T>{Krf^ , p)) when d is fixed and botli n and the internal parameter t go to infinity. We also show 
several applications of our Main Theorem. Specifically, we characterize aspects of the limiting behavior for 
the four previously introduced random hyper-graphs models. In the following section we formally state our 
Main Theorem and the results of its application. 

1.1 Main contributions 

A straightforward application of Talagrand's inequality (as stated in IIJLROOl Theorem 2.29]) yields that 
both L{Ti{Kn^n, k)) and L{Q{Kn,n,p)) are concentrated around any one of their (potentially not unique) 
medians. As we shall see, the same is true for L{Tj{K^\k)) and L{Q{Kn\p)). Somewhat equivalent 
statements hold for the the symmetric and anti-symmetric binomial random graph models. The following 
general notion will encompass the concentration type requirement the random hyper-graph models will need 
to satisfy in order for our Main Theorem to be applicable. 

Definition 3 Let T be a distribution over bipartite hyper-graphs and Med be a median of L{T). We say 
that F has concentration constant h if for all s > 0, 



Pr[L(J^) < (l-s)Med] < 2exp (-/is^Med) , 

/ 

Pr[L(J') > (l + s)Med] < 2exp -/i- -Med 

V + 

We say that the random bipartite hyper-graph model D = {T>{-)) has concentration constant h if each D(-) 
has concentration constant h. 

Note that if one can estimate a median of L{T) for some distribution T, show that the median and mean 
are close, and establish that T has a concentration constant, then one can derive a concentration (around 
its mean) result for L{F). Unfortunately, it is not in general easy to estimate a median of L(P(JC„j^... 
for the distributions ... we consider. However, we will be able to approximate them under some 

assumptions on ni, . . . , n^. In particular, we will show that there is a median that is proportional to the 
geometric mean of ni, . . . , n^^. The following definition captures the aforementioned assumptions we will 
need, and the sort of approximation guarantee that we will be able to establish. 

Definition 4 Led D = {'D{Kni,...,nj_)) be a random d-partite hyper-graph model with internal parameter t. 

Fix ni, ... ,12(1 and let N = ^HiLi ^'^'^ ^ ~ X]f=i denote the geometric mean and sum of 

Til, ... , Ud, respectively. We say that T) admits a (c, A, 0) -approximate median (or simply a (c, A, ^)-medianj 
if for all 5 > there are sufficiently large constants a{6), b{5), and t'{6), such that for all t > t',for which 

• Size lower bound condition: N > at-^, 

• Size upper bound condition: Sh < t^, 

it holds that 

(1-5)^ < Med < (1 + ^)^. 

In other words, if V = (^(i^n^ ... is a a random d-partite hyper-graph model with internal parame- 
ter t that admits a (c, A, 0)-median and the geometric mean (respectively sum) of ni, . . . , is (respec- 
tively S) are such that N = Q,{t-^) (respectively S = 0{t^)), then for sufficiently large t, every median of 
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L(P(ni, . . . , rirf)) will be close to cNt"-^. Although the above defined approximate median notion might 
at first glance sound artificial, we will see that it is possible to obtain such type of approximations for the 
random hyper-graph models we are interested on. 

Returning to our discussion, the relevance of the notion of approximate median is, when the random 
hyper-graph model admits a concentration constant, that it allows us to derive concentration bounds around 
an approximation of the median which in turn will be closed to the mean. Endowed with such estimates of 
the mean, we can easily derive the thought after limiting behavior of such expected values. This in essence, 
is the crux of our approach to attacking all variants of the Largest Non-crossing Matching problem. 

Unfortunately, the approximation of Med . . . , Ud))] guaranteed by the existence of a (c, A, 6)- 

median, as in Definition Ul holds for the rather restrictive condition &X]f=i f^i ^ ■ However, the mono- 
tonicity and block independence properties of random hyper-graph models allow us to relax the restriction 
and still obtain essentially the same conclusion. More precisely, it will be possible to obtain the same guar- 
antee, but requiring only that the sum of the n^'s is not too large in comparison with the geometric mean 
of the nj's. Moreover, and of crucial importance, under the same conditions one can show that the median 
and mean of . . . , nd)) are close to each other. The following result, which is the main result of this 

work, precisely states the claims made in the preceding informal discussion. 

Theorem 5 [Main Theorem] Let D = [^[Kn^^^^^^na)) ^ random hyper-graph model with internal pa- 
rameter t and concentration constant h which admits a (c, A, 6)-median. Fix ni, . . . , and let N and S 
denote the geometric mean and sum ofrii,..., Ud, respectively. Let < rj < min {X/{d — 1),9 — X} and 
g = 0{n 

For all e > there exists to and A sufficiently large such that if t > to is such that N > t^A (size 
constraint) and S < g{t)N (balance condition), then 



(1-6)^ < E[L(P(K„„...,„J)] < + 



(1) 



and the following hold: 

• If Med is a median of Med [L(V{Kn-i,...,nd))]> then 



(1-6) 



cN 



< Med < (1 + 6 



cN 



(2) 



There is a constant K > such that 



Pr 
Pr 



LiViKn„...,n,)) < (1 - e 



LiViKn,,...,n,))>{l + e: 



cN 
cN 



< exp 

< exp 



-Khe' 



-Kh 



,cN 
6^ cN 



(3) 
(4) 



Moreover, if ni = . . . = Ud = n and T> = {T>{Kn )) is just a weak random hyper-graph model, then the 
the lower bounds in ([T]) and Q, and inequality Q, still hold. 

As a consequence of the previously stated Main Theorem, with some additional work, we can derive 
several results concerning the asymptotic behavior of the expected length of a largest non-crossing matching 
for all of the random models introduced above. Our first two applications of the Main Theorem concern the 
random binomial hyper-graph model {Q{K^\p))ne'N and the random word model {Ti{K^\ k))ne-N- The 
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asymptotic behavior of the length of a largest non-crossing hyper-matching for both of these models is 
(interestingly!) related to a constant q that arises in the work of Bollobas and Winkler [BW881 concerning 
the height of a largest chain among random points independently chosen in the d-dimensional unit cube 
[0, l]*^. Specifically, for the random binomial hyper-graph model, we show: 



Theorem 6 For < p < 1, there exists a constant 6p such that 



1 

iim — E 

n— >oo n 



inf -E 

TieN n 



and 5p/ ^ — )• when p — 0. 



For the case where the underlying model is the one that arises when interested in the length of a longest 
common subsequence of d randomly chosen words over a finite alphabet, i.e. the random d-word model, we 
establish: 



Theorem 7 For A; G N, there exists a constant 7^ such that 



lim — E 

n— >oo n 



inf -E 

neN n 



Ik , 



and /fci-Vd 



7fc 



Cd when A; — )■ 00. 



The d = 2 case of Theorems |6] and |7] were already established by Kiwi, Loebl, Matousek ??. This work 
generalizes and strengthens the arguments developed in ??, as well as elicits new connections with other 
previously studied problems (most notably in IIBW88I ). 

Finally, we consider the symmetric versions of random graph models introduced above and show how the 
Main Theorem, plus some additional observations, allows one to characterize some aspects of the asymptotic 
behavior of the length of a longest non-crossing matching. Specifically, we prove the following two results. 



Theorem 8 For < p < 1, there exists a constant ap such that 

\\m -^[L{S{Kn,n,p))] = inf -E[L(cS(i^„,„,p))] = dp, 
n-s>oo n jieN n 

and OpI yjp — )• 2 when p — )• 0. 

Theorem 9 For < p < 1, there exists a constant Up such that 
n^oo 2n neN 2n 



and Op/ — )• 2 when p — )• 0. 



7 



1.2 Preliminaries 



For future reference we determine below concentration constants for the binomial and word models. 

Proposition 10 The d-dimensional binomial random hyper-graph model admits a concentration constant 
of 1/4. The random d-word model admits a concentration constant ofl/ {Ad). 

Proof: Let H be chosen according to Q{Km^^, _^nj:,p)- Since L{H) depends exclusively on whether or 
not an edge appears in H (and by independence among these events), it follows that L{H) is 1-Lipschitz, 
i.e. \L{H) — L{HA {e})| < 1. Moreover, if L{H) > r, then there is a set of r edges that are a witness for 
the fact that L{H) > r, for every H containing such a set of r edges. A direct application of Talagrand's 
inequality (as stated in IIJLROOl Theorem 2.29]) proves the claim about the concentration constant for the 
d-dimensional binomial random hyper-graph model. The case of the random d-word model is similar and 
left to the reader to verify. ■ 

1.3 Organization: 

For the sake of clarity of exposition and given that the arguments employed are different, we prove in 
separate sections the lower and upper bounds (as well as lower and upper tail bounds) of the Main Theorem's 
statement. Specifically, in Section |2] we establish all the lower bounds and lower tail bounds claimed in the 
Main Theorem. In Section[3j we prove the upper bounds and upper tail bounds stated in the Main Theorem, 
thence completing its proof. Finally, in Section]?] we apply the Main Theorem to four distinct scenarios. 
Specifically, we consider the cases where the underlying random model is the binomial random hyper-graph 
model, the random word model, the symmetric binomial random graph model, and the anti- symmetric 
binomial random graph model. 

2 Lower bounds 

In this section we will establish the lower bounds claimed in the statement of the Main Theorem, i.e. the 
lower bounds in ([T]) and Q, and inequality Q. 

Let V, c. A, 9, 7], and e be as in the statement of the Main Theorem. Let 5 > Obe sufficiently small so 

(1 - 6f{l -26) > 1 - I , (Definition of 6) 

and let a = a{6), b = b{6) and t' = t'{6) as guaranteed by the definition of (c, A, 0)-median. 

Since g = 0{t^), there are constants Cg > 1 and tg >0 such that g{t) < Cgt^ for all t > tg. Choose A 
large enough so 

Choose to > max {tg,t' {6)} sufficiently large so that for all t > to, 

g{t) < Cgti and CgbAti < t^-^ . (6) 

Now, assume t > to and that the geometric mean N and sum of ni , . . . , satisfy the size and balance 
conditions. Thus, the size constraint and balance condition guarantee that 

N > At^ and S < g{t)N < CgNf^ . (7) 
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Figure 4: Illustration, for (i = 3 and q = 4,of the construction of blocks Hi, . . . , Hq. 

Finally, assume H is chosen according to V{Kni,...,nj_)- 

If the rij's satisfy the size conditions of the definition of a (c, A, 0) -median and since the model admits a 
concentration constant, then we would have a concentration bound around cNt-'^ for L{H). Unfortunately, 
when some of the rij's are large, then S will be large, and the size upper bound condition need not be 
satisfied, leaving us without the desired concentration bound. To overcome this situation, we break apart H 
into hyper-subgraphs Hi, . . . , Hg of roughly the same size which we will refer to as blocks. The blocks will 
be vertex disjoint, the proportion between the sizes of the color classes in each Hi will be roughly the same 
than the one in H. However, the crucial new aspect is that the size upper bound condition will be satisfied 
in each block Hi allowing us to derive a concentration bound for L{Hi). This will later allow us to obtain a 
concentration bound for L{H), details follow. 

Let q = \N/{At^)'\. For each 1 < i < d, let n'- = \nj/q\. Henceforth, let N' and S' denote the 
geometric mean and the sum of the n^ 's. Denote the j-th color class of H by Aj. Recall that Aj is totally 
ordered. Let Aj^i be the first n'- elements of Aj, Aj^2 be the following n'j elements of Aj, so on and so 
forth up to defining Aj^q. Clearly, the Aj/s are disjoint, but do not necessarily cover all of Aj. Now, for 
1 < i < q, define Hi as the hyper-subgraph induced by H in Ai^i x . . . x j (for an illustration, see 
Figure |4]). Observe that the proportion between the sizes of the color classes of Hi is roughly the same as 
the one among the color classes of H. 

Note that by monotonicity, the distribution of Hi is ^(Kn/^ Moreover, since the Hi's are dis- 

joint, by block independence, their distributions. It follows that L{Hi), . . . , L{Hq) are independent random 
variables. A crucial, although trivial, observation is that 

L[H) > ^L{H,). (8) 

i=l 

On the other hand, by definition of q and the size constraint condition, 

N N 2N . r 

At^ - - At>^ ^ - At^ ■ (Estmiateofg) 

In order to estimate the geometric mean A^' of n'^, . . . , n'^, the following result will be useful. 

Lemma 11 If xi, . . . , Xd are positive real numbers, then 

d '^^^ 

11(^^ -1) ^ n 
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Proof: By induction on d. ■ 

It follows, by the preceding lemma, the estimate of q, and the balance condition, that 



N 



d 

n 



l/d 



> N' > 





( d 




> 


n-- 












d-l\ ^/d 



By Q, our estimate of q, and since r]{d — 1) < A, 

5^ < ^r<d-i.ri{d~iyx 

Given the way we have chosen A, we have that (1 - qS'^-^ /N'^y/'^ > 1 - 5 and thus 

^ >A.'> ^(l-i). 

q q 



(Estimate of N') 



Based on the preceding estimate of N' and the estimate for q we will now show that n'l, . . . , n'^ satisfy the 
size conditions required by the definition of (c, A, 0)-median. Indeed, by our estimate of N' and q, and (|5]) 

N' > —{1-S) > ^At^il-d) > at^. 

q ^ 

Moreover, by definition of S", our estimate of q, (|7]l, and (|6]l, 

S'h < — < < C.hAt^^'^ < 

- q - N ~ ^ 

Now, let H' be chosen according to ^^(i^ri'^,...,n^) and let Med' be a median of L{V{Kni^ yJ). By defini- 
tion of (c, A, 6')-median, we get that cN't~^{l - 5) < Med' < cN't^^{l + 6). Moreover, by definition of 
constant of concentration and approximate median, applying Markov's inequality yields, 

cN' 



E[L{H') 



cN' 

> (l-25)^Pr 

cN' 

> (l-2d)^Pr 



L{H') > (1-2(5)- 
L{H') > (l 



1 



Med' 



> (1 - 26) 



> (1 - 26) 



cN' 
cN' 



1 — 2 exp ( —h 



1 — 2 exp I —h 



(1-5)2 

6^ cN' 



Med' 



As observed above, iV' > At^{l-6)/2, so by choice of A we get that E [L(i?') ] > {l-26){l-5)cN't 
Hence, given that L{H) > Yli=i L{Hi), the estimate of N', the definition of 6, and elementary algebra, 

^ ^/Y' rN rN 

E[L{H)] > Y,E[Lm] > (1 - 26){1 - 6)q-^ > (1 - 25)(1 - 5)^^ > (l-e/2)^. 
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We have thus established the lower bound claimed in ([T]l. 
Now, we proceed to show ([3]). Note that 



Pr 



LiH) < (1 - e) 



cN 



< 



Pr[L{Hi) = Si,i = l,...,q] . (9) 

(si,...,sg)eN9 

si+...+Sq<(l-e)cNt-^ 



Let The the set of indices of the summation in the preceding displayed equation. Also, for T = (si, . . . , Sq) 
belonging to T let Pt denote Pr [L{Hi) = Si,i = 1, . . . , q]. We will show that Pt is exponentially small 
with respect to cA''t~^. Recalling that the L{Hi)''s, are independent and distributed as L{H') when H' is 
chosen according to „^), 



Pt = llPr[L{Hi) = Si] < {Pr [L{H') < s^]y . 



i=l 



Again, by the way in which H' is chosen, the definition of Med', and the definition of (c, A, 0) -median, for 
all i such that Si < {1 - 6)cN't-^ < Med' < (1 + 6)cN't-^ < 2cN't~^, it holds that 



Pr[L{H')<Si] = Pr 



- Med' 



L{H') < ^1 

< 2 exp 



Med' - s 
Med' 

ht^ 



Med' 



-i{l-6)cN't-^-s,) 



Hence, for all 1 < i < g, 

Pr[L{H')<Si] < 2 exp 

and then 



'2cN' 



max 



{0,((l-5)ciV't-^-5,)2} 



-IiiPt > - Y,\^Px{L{R')<Si\ > -(?ln(2) + — ^(max|0,(l-<5)cAr't-^-Si|j . 

i=l ^ i=l 

By Cauchy-Schwartz's inequality, our estimate of N', the fact that si + . . . + < (1 — e)cNt^-^, and since 
by definition of S we know that (1 — 5)^ > 1 — e/2. 



, (max {0,(1- (5)ciV't-^-s,})^ > ^ max |o, (1 - 5)ciV't-^ - s^} 

\ t=l i=l 



> (1 - (5)ciV'gt"^ - ^ Si > {l-6fcNt-^-{l-e)cNt-^ > 



i=l 



cNe 



Combining the last two displayed inequalities and recalling our estimate of A^', we get 

, , , ht^ c^N^e^ , , , hcNe'^ 

-IuPt > -gln(2) + ;^-^^ > -gln(2) + ^^. 



2cN'q 4*2 
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By Q and using tlie standard estimate (^) < {ea/h)^, we have 



Pr 



L{H) < (1 - e) 



cN 



< Pt < ITI-maxPT 



< ' • maxPT < exp gin 2e[l + (1 - e)cm"^/g] ^ 



Now, by g's estimate we know that N < qAt^ < 2N. Thus, if we require that A is large enough so that 

ln(2e[l + (1 - e)cA]) < Ahce^ /32, we get that 



Pr 



cN 

L{H)<{l-e)^ 



< exp — 3- ln(2e[l + (1 - e)cA]) - < exp ' 



This proves the lower bound claimed in Q. 

What remains is to show the lower bound in ([2]). By g's estimate we have N > At^ which together with 
our choice of A (see ([5])), imply that 

he'^cN\ f he'^c \ 1 



Combining the last two displayed equations, it follows that Pr [L{H) < (1 — e)cNt^'^~\ < 1/2, implying 
that any median of L{H) must be at least (1 — e)cNt^-^. 

Remark 12 The reader may check that all claims proved in this section still hold if instead of T) = 
(T>{Kn-^^^^^^na)) had worked with a weak random hyper-graph model V = (T>{K^^)). Indeed, if this 
would have been the case, then for H chosen according to T)(Kif^), the hyper-graphs Hi, . . . , Hq obtained 
above from H would have all their color classes of equal size, and the weak random hyper-graph model 
assumption is all that is all that is need to carry forth the arguments laid out in this section. 

3 Upper bounds 

In this section we will establish the upper bounds claimed in the statement of the Main Theorem, i.e. the 
upper bounds in ([T]l and ([2]), and inequality Q. The proof of the latter of these bounds, the upper tail bound. 



is rather long. For sake of clarity of exposition, we have divided its proof in three parts. First, in Section 3.1 



we introduce some useful variables. In Section 3.2 we establish (|4]) for not to large values of the geometric 



mean N. Then, in Section 3.3 we consider the case where N is large. Finally, in Section 3.4 we conclude 



the proof of the bounds claimed in the Main Theorem. 
3.1 Basic variable definitions 

For the rest of this section, let V, c. A, 9, t], and e be as in the statement of the Main Theorem. Define 

r e2 el 

5 = min < 1 , ^ ^ ' 6 ) ' (Definition of 6) 

Let a = a{6), b = b{5) and t' = t'{S) as guaranteed by the definition of (c, A, 0)-median. Choose A so 

r a 81n(2)l 
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For technical reasons, it will be convenient to fix constants a and f5 such that 



\ < a < (3 < e-r]. (11) 

We shall also encounter two constants Ki and K2, depending solely on d. Since g = 0{t^), there are 
constants Cg > 1 and tg > t' such that for all t > tg it holds that g{t) < Cgt^, and 



max 



[^At^.e] < e < < T^t^-\ (12) 
L J baCg 



2t^ 2aKi ^ 
c - 5cK2h - ln(t)- ^ ' 

Consider now t > tg and the positive integers ni , 722 , . . . , n^^ with geometric mean N, summing up to S, and 
satisfying both the size constraint condition (A^ > At^) and balance condition (S < g{t)N). Furthermore, 
define M = cNt"'^ and choose H according to 'D{Kni^...^ni)- In the following two sections, we separately 
consider the case where is less than and at least . 



3.2 Upper tail bound for not to large values of 

Throughout this section, we assume N < t^. We will show that H satisfies the size lower bound restriction 
in the definition of (c, A, 6*) -median. The fact that V admits a concentration constant h will allow us obtain 
a bound on the upper tail of L{H). 



Let t > tg. Since N satisfies both the size constraint and balance condition, by ( |T0| ), ( [12] ), and the 
definition of 5, 



at^ 

N > At^ > — > at^ , 


Sb < g{t)bN < Cgbf^^^ < ^ < 



Thus, ni, . . . ,nd satisfy both the size lower and upper bound conditions of the definition of (c, A, 0) -median. 
Hence, if H is chosen according to then every median Med of L{H) is 6M close to M. Simple 

algebra, the definitions of concentration constant and (c, A, 6')-median, and given that by definition of 6 we 
know that 5 < e, we have 



Pr[L{H)>{l + e)M] = Pr 



Z,m > I 1 + <l±4»^;^ I Med 
^ ^ * Med ' 



((1 + 6)M-Med)2^ ^ _A A^-^r 



< 2e.,^-h - J < exp^ln(2)-/.^M 

By ( fTO] ), since N > At^, the fact that by definition of S we know that 6 < e^/(l + e), and recalling that 

M = cNt-^, 

Pr[L{H)>{l + e)M] < " 

< exp r -M < exp -M . 

^\ 8t^ 4(l + e) ; - 8(1 + e) J 

We have thus established (|4| for < t^. 
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3.3 Upper tail bound for large values of 

We now consider the case where N > t^. The magnitude of N is such that we can not directly apply 
the definition of (c, A, 6')-median to a hyper-graph generated according to ... and thus derive the 

sought after exponentially small tail bound. We again resort to the block partitioning technique introduced 
in the proof of the lower bound. However, both the block partitioning and the analysis are more delicate and 
involved in the case of the upper bound. 



3.3.1 Block partition 

Let / = t", L = Cgt'i+°' and 

\il + e)M]. (14) 

In what follows, we shall upper bound the probability that H chosen according to ... has a non- 

crossing hyper-matching of size at least rrimax, i-C- the probability that L{H) > rrimax- 

We begin with a simple observation; since distinct edges of a non-crossing hyper-matching of H can 
not have vertices in common, L{H) < ni for all i. It immediately follows that L{H) is upper bounded by 
the geometric mean of the n^'s, i.e. L{H) < N. Thus, if rrimax > N, then Pr [L{H) > rumax ] = 0. This 
justifies why, in the ensuing discussion, we assume that m^ax 

< N. 

Let J be a non-crossing hyper-subgraph of Kni,...,na such that the number of edges of J is (exactly 
equal) rrimax- We shall partition the edge set of J into consecutive sets of edges to which we will refer as 
blocks. The partition will be such that for any color class, the set of vertices appearing in a block are "not to 
far apart", the precise meaning being clarified shortly. The maximum number of edges in any block will be 
Smax, where: 



I 

^max 



N 



(15) 



Given two edges e and e of J, such that e ^ e, we denote by [e,e] the collection of edges f of J such 
that e < f < e. We now define a partition into blocks of the edge set of J, denoted V{J), as follows: 
V{J) = { [cj, ij] I 1 < i < g } where the e^'s, the e^'s, and q are determined through the following process: 

• ei is the first (smallest according to <) edge of J. 

• Assuming ei = {v'f^ , t>2*^ , • • • , vf* ) has already been defined, ej = , -Ug*'* , • • • , v^^^ ) is the last edge 



of J satisfying the following two conditions (see Figure 3.3.1 for an illustration): 

- [ci, Ci] has at most Smax elements. 

~(i) (i) 

- Vj — Vj < L for all 1 < j < (i (where we have relied on the abuse of notation entailed by our 
identification of the j-th color class of -ftrni,...,nrf with the set {1, 2, . . . , rij} endowed with the 
natural order). 

• Assuming Cj has already been defined and provided there are edges e of J strictly larger than ei, we 
define Cj+i to be the smallest such e. 

Clearly, the value taken by q above depends on J. Nevertheless, we will show that the following estimate of 
q = |'P(J)| holds for all J non-crossing hyper- subgraphs of Kni,...,n/- 

y < 1^(^)1 < (Estimate of g) 



14 



Figure 5: Partition into blocks of a hyper-graph. Each block [cj, Cj] (shown in light grey) contains at most 
Smax edges and at most L vertices from each color class. 



Note that each block has at most Smax edges and recall that J) I = i^^max- Thus, (7 > rrimax/smax > N/l. 
Now, say a block is short if it is either [cq, Cq] or a block with exactly Smax edges. Let Iq be the collection 
of indices of short blocks. It follows that rrimax > (l-^ol — ^)smax- However, since rrimax > M = cA^t^^, 
we know that 

rrimax ^ iV 1 ^ iV 1 

Smax ~ I ' I- N/{lmmax) ~ I ' I- t^-"^ j C ' 

We thus have, since ([T3]l implies that t^^'^ < c/2, that |/o| < 2N/1. 

Say a block is regular if it is not short, and let Ii = [q] \ Iq be the set of indices of such blocks. We shall 
call block cover the collection of all nodes between the first edge of the block (inclusive) and the first edge 
of the next block (exclusive). By definition of block partition, if the i-th block is regular, then for some color 

class j, we must have vf^^'^ — Vj^'^ > L. Hence, Yl'j=ii'^j^~^^^ ~ '^f^) > ^- other words, a regular block 
gives rise to a block cover of cardinality at least L. Since every node belongs to at most one block cover, 

< S/ L. Recalling that L = Cgf /I and that S satisfies the balance condition (hence, S < Cgf^N for 
t > tg), we conclude that < N/l. 

Putting together the conclusions reached in the last two paragraphs, we see that q = |/o| + < 3A^//, 
which establishes the claimed estimate of q. 



3.3.2 Partition types 

Let Si be the number of edges of J in the i-th block [cj, Cj] of the partition V{J). Let q be the number of 
blocks of V{J). We refer to the (3(7)-tuple T = (ei, ei, si, . . . , Cg, e^, Sq) as the type of partition V{J), and 
denote it T{V{J)). Furthermore, let The the collection of all possible types of partitions of hyper-subgraphs 
of Kni,...,nd with exactly mmax edges. 

( N \ 

Lemma 13 There is a constant Ki, depending only on d, such that \T\ < exp I Ki — ln(Z) j . 

Proof: Observe that each is completely determined by specifying its vertices. Hence, the number of ways 
of choosing ei, . . . , is at most the number of ways of choosing q elements from each of the node color 
classes, i.e. at most HiLi C^')- "^^^ number of choices for ei, . . . , is bounded by the same amount. On 
the other hand, since J has exactly mmax edges, the number of choices for si, . . . , Sg is at most the number 
of ways of summing up to rrimax with q positive integer summands. Since we are assuming that rrimax < ^ 
(see comment in this section's second paragraph), we have that the aforementioned quantity can be bounded 
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by (^) . Using that (^) < (ea/b)^ we obtain, for fixed q, that the number of types is bounded by 

no)" ^ (T)-(ff-'-) 



Recalling our estimate for q, we get that 



q=\Nm 

Since ln(x) < x for all x > and by ([T2]l we know that I = > e, 



/3N\ 3N 3N N 

In in < Inf +(l + 2d)^(l + ln(/)) < (2 + 2d)^(l + ln(0) < 12(1 + ln(/) . 

The desired conclusion follows choosing Ki = 12(1 + d). ■ 
3.3.3 Probability of a block partition occurring 

The purpose of this section is to show that for a given fixed type T, with exponentially small in M probability 
a hyper-graph chosen according to 'D{Kni,...,nd) contains a hyper-subgraph of type T with nimax edges. 
Specifically, we will prove the following result. 

Lemma 14 For T £ T, let Pt denote the probability that a hyper-subgraph randomly chosen according 
to 'D{Kni,...,na) Contains a non-crossing hyper-subgraph J with rrimax edges such that T{V{J)) = T. 
Then, for some absolute constant K2 > 0, 

Pt < exp (-K2h,^^M 



We now proceed with the proof of the preceding result. Let T = (ei, ei, si, . . . , eg, Cg, Sg). As before, for 

all i, let Ci = {vi\v2 \ • • • , Vg^^) and Cj = {v^\v2 \ ■ ■ ■ ,Vq^)- Let ffbe chosen according to V{Kni,...,na)' 
and let Hi be the hyper-subgraph of H induced by the nodes between Cj and Cj, i.e., 

(j) (i) , , (i) (i) , . (i) (i) , . ~(j) 

Note that Hi is distributed according to 'D{K (i) (i) (i)), where n^^^ = v\^'^ — v\^^ + 1 is the size of the 

j-th color class of H^. Moreover, if there is a hyper-subgraph J of H such that T{J) = T, then it must hold 
that L{Hi) > Si, for alH = 1, . . . , g. Since by hypothesis, V satisfies the block independence property, the 
events L{Hi) > Si, i = 1, . . . ,q, are independent, so 

(i) U)) ) > -Si 



i=l 



(i) (i) 

Now, let Ni and Si denote the geometric mean and sum of n\ , ... ,ny , respectively. The i-th term in the 
product of the last displayed equation will be small provided the sizes of the color classes of Hi, i.e. the 
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nj*'''s, satisfy the size constraints of the definition of a (c, A, 0)-median. Unfortunately, this may not occur 
for every i, somewhat complicating the analysis. Below we see how to handle this situation. 

Since T{V{J)) = T, we know that n(\n'^\ ■ ■ ■ < L. Recalling that a < /3 and applying ([13]) 
we conclude that Sib < dbL = Cgdbt^'^"' < Cgdbt'^'^^ < t^, so the size upper bound condition of the 
definition of a (c, A, 0)-median holds. However, the same might not be true regarding the size lower bound 
condition Ni > at^. In order to handle this situation, we artificially augment the size of the blocks where 
the condition fails. Specifically, for alH = 1, . . . , g and j = 1, . . . , d we define: 



n 



{i) 



max ^6njAt^/N,nf^ . 



As usual, let Ni and Si denote the geometric mean and sum of the rij s. Now observe that when we 
augment the sizes of the color classes of the hyper-graphs chosen, by the monotonicity property of random 
hyper-graph models, the probability of finding a non-crossing hyper-subgraph of size at least Si increases. 
Hence, 



Pt < 



i=l 



L[V[K n 



(1) (d) 

n- ,...,n- 



> Si 



< 



L[V[ Kn 



-(1) -(d) 

n; ,...,n- 



> Si 



i=l 



We claim that the A^j's and Si's satisfy the size conditions in the definition of a (c, A, 0)-median. Indeed, by 
sin 



definition of of n^^'^ , since n^*^ < L, and 



6nj 



6L < L. 



it follows that Uj < L, and thence, as before augmenting the block sizes. Sib < t . On the other hand, by 
definition of n^*\ given that A^ > At^, and since by ([TO]) we know that A> a/5, 



i/d 



N,, 



n 



(0 



> 




6At^ > at^. 



This concludes the proof of the stated claim. 

Now, let Medj be a median of L{V{K_(i) ^(d)))- By definition of (c, A, 0)-median, 



(1 - 6)cNir^ < Medi < (1 + 6)cNit 



-A 



Hence, for all i such that Sj > (1 + 6)cNit " > Medj, and using that /i is a concentration constant for the 
random model V, we get 



L V Kn 



Pr 



Since Sj < Smax for all i, 



-(1) -(d) 

n- ,....n- 



> Si 



< 2 exp 

< 2 exp 



-h 



-h 



{St 



{s,-il + 5)cNit-^)^ 



Pr 



L V Kn 



-(1) -(d) 

n. ,...,n- 



> Si 



(max {0, Si - {l + 6)cN,t~^})'^ 



< 2 exp 

< 2 exp 



(max {0, Si - {l + 6)cNit~^})'^ 
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Combining some of the previously derived bounds 



IiiPt > -In 



vt=l 



h 2 
> -q\n{2) + ^(ina.xh,Si-{l+6)cN,t-^]) . 



We now focus on the summation in the last term in the preceding displayed equation. We lower bound it, 
via the following generalization of Holder's Inequality. 

Lemma 15 [Generalization of Holder's Inequality] For any collection of positive real numbers Xij, 1 < 
i < Q, ^ < j < d, 



(<? d 
i=i j=i 



< 



d q 

EE 

j=i i=i 



'1,] 



Setting Xij = (n^*^)^/'^ in the aforementioned stated inequality, observing that by definition of nj"' we have 
^j*^ < n^j^ + SrijAt^ /N, and recalling that the sum of n^*^ , • • • , n^*^ is at most nj, 

l/d / \ l/d 

i=l 



d q 

, j,=i ,-1 



d q 

- I n E^^^f + ^^3^^^/^) I < ^^(1 + 5qAt^/N) 
, j,-=l i=i 



Because of our estimate for q and ( [T2| ), we conclude that 

q 



< N{l + 2>5At^-'^) < N{l + 5/3) < N{l + 6). 



i=l 



By Cauchy-Schwartz's inequality and recalling that the sum of the Sj's is exactly equal to rrimax = [(1 + 
6)M1, 



. q^ {max {0, Si- {l + 6)cN it- > ^ max |o, Si - (1 + (5)ciVit-^| 
\ 1=1 1=1 

q 

> mmax-il + S)ct~^Y.^^ ^ M{l + e)-M{l + 6f. 



i=l 



Lets now see that the just derived lower bound is actually positive. Recall, that by definition of 5 we know 
that S < e/6 and 5 < 1, so 



(1 + e) - (1 + (5)2 = e-25-5^ > e - 35 > e/2. 



We then have. 



q^{max{0,Si-il + 6)cNit->^})^ > 
\ i=i 



eM 
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Putting things together, and since Smax < + c)-^' we find that 



In Pt > -gln(2) + -^^(max|o,Si-(l + (5)ciVif"^|)' 

1=1 

h e^M^ , , , hNe^M 



q 



> -gln(2) + — > -gln(2) + 



Finally, recall that by our estimate for q we know that q < 3N/1 and by (12) we have that I = t"' > 9At^, 
so 

iVln(2) he^M cN f he"^ ln(2)\ 

- In Pt > tV + ^ = ^ — } ^ • 

3At^ 12(1 + e \12{l + e) 3Ac J 



By ( lOl we know that ^4 > 8 ln{2)/ {hc6), by definition of 6 we have that 6 < e / (1 + e), implying that 



cN / he^ h6\ e2 hM 

In Pt > ^ — > 

- V12 1 + e) 24/ - 1 + e 24 



We have thus shown that Lemma 14 holds taking K2 = 1/24. 



3.3.4 Upper tail bound 

We are now ready to finally prove ^ior: N >t^ . First, note that 

Pr[L(P(K„,,.,„„J) >m^„^] < ^Pt < |r| • max Pt. 



By Lemmas 13 and 14 the fact that I = t", by our choice of tg so ( [13] ) would hold, recalling that by 
definition of 5 we have that 5 < /{\ + e) and given that M = cNt~'^, 

< explKijln{l)-K2h- Mj = exp lKia—ln{t) - K2h- M 

/6K2hcN , e2 ^ \ / i^a/ie^ 

< exp — ^ - K2h M < exp ^ -M 

- ^\ 2 ' l + e J - 2(l + e) 

We thus conclude that Q holds for any constant K < K2/2 (since K2 = 1/24, any K < 1/48 would do). 
3.4 Upper bounds for the mean and median 

We will now establish the two remaining unproved bounds claimed in the Main Theorem, i.e. ([T]l and ([2]). 



Fix e = eo > and choose 6, A, a, /3, Cg, tg, Ki and K2 as in Section 3. 1 We can view (5 as a function 
of e, henceforth denoted (5(e). Similarly, we can view A and tg as functions of 5, denoted A{5) and tg{5) 
respectively. Let A' be a sufficiently large constant so 

ex-p(-Kh— — ^° , ^ cA^ < — , and < ^cvl' . (Definition of A') 



4(l + eo/2) y - 28' 6i^/i " 4 

Also, let 6q = (5(eo/2). Observe that by definition of 5, for every e > 6 we have that (5(e) = 1. Define now 

A = max{^((5o), A(l),yl'}, andtg = max{tg((5o), tg(l)}. 
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Let t > tg and consider the positive integers ni, . . . , with geometric mean N and summing S satis- 
fying the size and balance conditions in the statement of the Main Theorem, i.e. 

N > At^ , and Sb < g{t)N . 

The choice of tg and A guarantee that (|4]) holds for e = eo/2 and for all e > 6. 

As usual, let H be chosen according to V{Knj,...,na) let M = cNt^^. Let 1^(3;) denote the 
function that takes the value 1 if x G A and otherwise. Observe that 

E[L{H)\ = E[L(i/)l[o,(i+,„/2)M)(^(^))] 

+ E[L(if)l[(i+,,^/2)MJM)(^(^))]+E[L(i7)l[7M,+oo](^W)] . 

Lets now upper bound separately each of the terms in the right hand side of the preceding displayed equation. 
The first one is trivially upper bounded by (1 + eo/2)M. Thanks to (|4]), since N > At''^ > A't^, and by 
definition of A', 

E[L(//)l[(i+,„/2)M,7M)(^(^))] < 7MPr[L(F) > (l + eo/2)M] 

< 7Mexp(-Kh—^^.^) < 7MeKp(-Kh—^^---cA') < 

4(1 + eo/2) tV " V 4(1 + eo/2) 7 " 4 

Now lets consider the third term. By Q, since for e > 6 it holds that e/(l + e) > 6/7, given that 
M = cNt~^ >cA> cA', and by definition of A', 

E[L{H)l[7M,+oo]{L{H))] = / Pr[L{H)>t]dt = M Pr [L(F) > (1 + e)M] de 

< M y exp y-Kh^^ -Mjde < M J exp — ■ Mej de 

..f^Kh^\'^ ( 36Kh , \ .,fQKh Meo 
= m(^— Mj exp(^ ^"^^j - ^y-^-^^) < 

Summarizing, we have that E [ L{H) ] < (1 + eo)M which proves ([T]l. 

Finally, we establish Q. Again, let e > and choose 5, A, a, /3, Cg, tg, Ki and K2 as in Section 3.1 

Let 

A' = max {a, ^i^t^llljMl (Definition of A') 

[ Khce^ J 

Now, let t > tg and ni, . . . , be positive integers with geometric mean N and summing up to S satisfying 
the size and balance conditions with respect to the just defined constant A', i.e. 

N > A't^, and Sb < git)N . 

By Q and definition of A', it follows that 

Pr[L{H)>{l + e)M] < exp (^-Kh,^^ ■ < exp (^-Kh,^^ ■ cA'^ < ^. 

Hence, every median of L{H) is at most (1 + e)M, thus establishing (|2]l and completing the proof of the 
Main Theorem. 



20 



4 Applications 



4.1 Random binomial hyper- graph model 

In this section, we show how to apply the Main Theorem to the d-partite random binomial hyper-graph 
model. 

We will show that the constant c of the definition of a (c, A, 0)-median for this model is related to a 
constant that arises in the study of the asymptotic behavior of the length of a longest increasing subsequence 
of d — 1 randomly chosen permutations of [n] , when n goes to infinity. We first recall some known facts 
about this problem. Given a positive integers d and n, consider d permutations of vri, . . . , vr^ of [n]. We 
say that L = { {ij , vri (ij ),..., vr^i (ij ) | 1 < j < ^ } is an increasing sequence of (vri , . . . ,tT(i) oi length £ if 
h < 12 < ■ ■ ■ < H and TTt{ii) < vrt(i2) < . . . < TTt{ie) for 1 < t < d. We denote by lisd+i{n) the random 
variable corresponding to the length of a longest increasing subsequence of (vri, . . . , vr^) when tti , . . . , vr^ are 
randomly and uniformly chosen. The study of the asymptotic characteristics of the distribution of lisd{n) 
will be henceforth referred to as Ulam's problem in d dimensions (note that the d = 2 case corresponds 
precisely to the setting discussed in the first paragraph of the introductory section of this work). 

Ulam's problem in d-dimensions can be restated geometrically. Indeed, consider x{l), . . . , x{n) uni- 
formly and independently chosen in the d-dimensional unit cube [0, 1]"^ endowed with the natural component 
wise partial order. Let Hd{n) be the length of a largest chain C C {x(l), . . . , x(n)}. It is not hard to see 
that Hd{n) and lis din) follow the same distribution. BoUobas and Winkler IIBW88II showed that for every 
d there exists a constant Cd such that Hd{n) / ^fn (and thus also lisd{n) / ^/n) goes to as n — )• oo. Only 
the values ci = 1 and C2 = 2 are known for these constants. However, in |BW88,I it is shown that q < Cj+i 
and Ci < e for all i, and that the lim^^oo Cd = e. 

Now, back to our problem. Our immediate goal is to estimate a median of L{Q{Kn-^^^,,,^na:P))- Con- 
sider H chosen according to G{Kni,...,najP) 1st H' be the hyper- subgraph of H obtained from H after 
removal of all edges incident to nodes of degree at least 2. Let E = E{H) and E' = E{H'). In order to 
approximate a median of L{Q{Kn-i,...,na,p)) it will be useful to estimate first the expected value of L{H'). 
We now come to a crucial observation: L{H') is precisely the length of a largest chain (for the natural 
order among edges) contained in E' , or equivalently the length of a longest increasing subsequence of d — 1 
permutations of {1, . . . , The preceding observation will enable us to build on the known results con- 

cerning Ulam's problem and use them in the analysis of the Longest Non-crossing Matching problem for 
the random binomial hyper-graph model. In particular, the following concentration result due to BoUobas 
and Brightwell IIBB92I for the length of a d-dimensional longest increasing subsequence will be useful for 
our purposes. 

Theorem 16 [BoUobas and Brightwell IIBB92\ Theorem 8]] For every d > 2, there is a constant Dd such 
that for m sufficiently large and 2 < A < m}-!'^'^ j log log m, 



Pr 



,,. . ^ c r,. / Ml ^ A-Ddmi/2'^log(m) 

\lisd{m) - E \lisd\jn)\\ > — , — — 

log log (mj 



< SOA^e" 



We will not directly apply the preceding result. Instead, we rely on the following: 

Corollary 17 For every d > 2, t > and a > 0, there is a mo{t,a,d) sufficiently large such that if 
m > rriQ, then 



Pr 



lisd{m) - Cdm^^^ > tcdrn^^'^ 



< a. 
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Proof: Let be the constant in the statement of Theorem 16 By definition of Ulam's constant, we know 
that liiiiri^oo E [ lisd{m) ] / ^fm = Cd- Hence, we can choose uiq = mo{t, q, d) sufficiently large so that for 
all m > rriQ, Theorem [16] holds and in addition the following conditions are satisfied: 



I E [ lisd{m) ] I - Crfm^/"' < tcdm^^'^/2. 



< 



log log(m) 



and SOA^e-^' 



< a. 



(Both conditions can be satisfied since (log log(m,))^ = o(log(m)) and given that A(m) — )• oo when m 
oo.) It follows that for all m > mo. 



Pr 



lisd{m) - Cdrn^/'^ 



> tcdm 



i/d 



< Pr 

< Pr 

= Pr 



\lisd{m) - E [lisd{m)] \ + E [lisd{m)] - Cdm^^'^ 



\lisd{m) - E [lisd{m)]\ > ^tcd'm^^'^ 



> tcdm 



l/d 



\lisd{m) - E [lisd{m)]\ > 



ADrfmV2rflog( 



m 



log log(m) 



< SOA^e-^' 



For future reference, we recall a well known variant of Chebyshev's inequality. 

Proposition 18 [Chebyshev's inequality for indicator random variables] Let Xi, . . . , be random vari- 
ables taking values in {0, 1} and let X denote Xi + . . . + Xm- Also, let A = ^ • j.^^j E [ XiXj ]. Then, for 
all t>0, 

Pr[\X-E[X]\>t] < 1(E[X](1-E[X]) + A) . 



Moreover, ifXi, . . . , X^ are independent, then 



Pr [|X - E [^11 > i] < 



E[X] 

t2 



Proof: Observe that since Xi is an indicator variable, then E = E [^j]- Thus, if we let V [^] 

denote the variance of X, 



V[X] = E[X2]-(E[X])2 = ^E[X2]+A-(E[X])2 = e [X] (1 - E [X]) + A . 



A direct application of Chebyshev's inequality yields the first bound claimed. The second stated bound, 
follows from the first one and the fact that if Xi, . . . , X^ are independent, then A < (E [X j)^. ■ 

We will also need the following two lemmas. 

Lemma 19 Let N and S denote the geometric mean and sum o/ni, . . . , Ud- If N = (j^'^^^^nj — 1)^ , 
then N'^- N"^ < S"^'^. 
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Proof: Direct application of Lemma 15 



Lemma 20 Let N and S denote the geometric mean and sum o/ni, . . . , rid- If N 
then the following hold: 



E[\E\ 

e[\e'\ 

E [|^\^'| 
Moreover, for all rj > 0, 



= N'^p{l - p)^"-^" > N'^p{l- S'^-^p). 
< N'^S'^-^p^. 



Pr[|^|-E[|^|] >??E[|-E|]] < 



(16) 
(17) 
(18) 

(19) 



ri^E[\E\] ■ 

Proof: Let K = i^ni,...,nfc> and for each e G E{K) let X^. and Yg denote the indicators of the events e ^ E 
and e G E' , respectively. Note that \E\ = X]ee-B(ii') and \E'\ = Yl,e<^E{K) ^e- Clearly, E [^e] = P 
for all e G E{K). Moreover, e G E' if and only if e G E and no edge / G £' \ {e} intersects e. 
Since the number of edges in E{K) that intersect any given e G E{K) is exactly N"^ — N'^, we have that 
E [ y; ] = p{l- p)^"^-^"^. Observing that \E{K)\ = N'^ we obtain ([16]) and the first equaUty in ([l7). On 
the other hand, since (1 — p)™ > 1 — pm and by Lemma [19} we can finish the proof of ( [TT] ) by noting that 



E[\E'\] = N'^p{l-p) 



p) . 



Inequality (18 1 is a consequence of ( [161 ), ( 17 1, and the fact that E' C E, as follows: 

e[|^\^'|] = e[|^|-|^'|] < N'^S'^-^p'^, 
Applying Chebyshev's inequality for independent indicator random variables { | e G E{K) } yields ([W 



We are now ready to exploit the fact, already mentioned, that L{H') equals the length of a longest increasing 
subsequence of d — 1 permutations of {1, . . . , and then apply Corollary 17 in order to estimate its 

value. Formally, we prove the following claim. 

Proposition 21 Let 6 > 0, d > 2, and N and S be the geometric mean and sum of positive integers 
m, . . . , rid, respectively. Moreover, let M = CdNp^^^ where Cd is the d-dimensional Ulam constant. Then, 
there is a constant C = C{6) sufficiently large such that: 

• IfNp^/'^ > C and US'^'^'^p'^^'^^'^ < d'^~^5cd, then every median of L{g{Kn,,...,na,p)) is at most 
{1 + 6)M. 

• IfNp^/'^ > C and US'^^^p < S, then every median of L{Q{Kni,...,ndTP)) at least (1 — 6)M. 

Proof: To prove that every median of L{Q{Kn-i^,,,^najP)) is at most (1 + 6)M, it suffices to show that 
Pr [L{H) > (1 + d)M ] is at most 1/2. To establish the latter, note that L{H) < L{H') + \E\ E'\, hence 

M6' 

~Y 

M6' 



Pr[L{H)>{l+5)M] < Pr 

< Pr 



\E\E'\ > 
\E\E'\ > 



+ Pr [LiH') > {1 + S/2)M] 
+ Pr 



\E'\> {1 + 6/2)^ 



+Pr 



L{H') > (1 + d/2)M, \E'\ < (1 + 6/2) 
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We now separately upper bound each of the latter three terms. For the first one, we rely on Markov's 
inequality, inequality ( [TT] ) of Lemma 20 the fact that N < S/d, and our hypothesis, to conclude that: 

MS' 



Pr 



\E\E'\ > 



2 r|„> wn 27^^ 5'^- V 



- dd-^cd - 6 



To bound the second term, note that \E\ > \E'\, and recall (fT6]) and (fT9]) of Lemma 20 

4 



so 



Pr 



\E'\>{l + 6/2) — 



Pr[\E\>{l + 6/2)E[\E\]] < 



6^E[\E\] 6^N^p 



Since by assumption N'^p > C^, it suffices to take > 24/(5^ in order to derive an upper bound of 1/6 
for the second term. 

Finally, we focus on the third term. Let m = [(1 + 5 /2)M'^ / c'^\ . Recall that conditioned on \E'\ = n', 
the random variable L{H') follows the same distribution as lis{n'). Thus, since n > n' implies that lis{n) 
dominates lis{n'), and given that (1 + x)" < 1 + ax for x > — 1 and < a < 1, 



Pr 



L{H') > (1 + 6/2)M, \E'\ < (1 + 6/2) 



< Pr 



Pr 



< Pr 



lisd{m 



hsd(m) > , ,, Cdm ' 

Setting t = {d— 1)6 / {2d + 6) and requiring that > mo + 1 with rriQ = mo{t, 1/6, d) as in Corollary 
and since by assumption N'^p > C^, we have 

m = [{l + 6/2)M'^/ci\ = [{l + 6/2)N^p\ > [C'^J > mo- 

Thus, we can apply Corollary [TT] and conclude that 



17 



Pr 



L{H') > (1 + 6/2)M, \E'\ < (1 + 6/2)A/r/ci 



< 



6 



In summary, Pr [L{H) > (1 + 6)M] < 3(1/6) = 1/2 as we wanted to show. 

Now, to prove that every median of L{Q{Kni^,,,^na,p)) is at least (1 — 6)M, it suffices to show that 
Pr [L{H) < (1 — 6)M] is at most 1/2. Note that L(-) is non-negative, so we can always assume that 
6 <1. Since L{H') < L{H), 

M'^ ■ 

1^1 <(1-'^)^ 



Pr [L{H) < {1 - 6)M] < Pr 

< Pr 



\E\<il-S)-^ 



+ Pr 
+ Pr 



L{H') < {1-6)M,\E\ > (1-6) 



\E\E'\>{6/2) — 



+Pr 



L{H') < (1 - 6)M, \E'\ > (1 - 6/2) 



As above, we separately bound each of the two latter terms. In the case of the first term, by ( fT6| ) and ( [T9| ) of 



Lemma 20 



Pr 



\E\ < (1-6)- 



Pr[|S| < (1-(5)E[|^|]] < 



1 

¥e\\e\ 



1 
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Since by assumption N'^p > C^, it suffices to take > 6/5^ in order to establish an upper bound of 1/6 
for the term under consideration. 



To bound the second term, simply apply Markov's inequality, use ( [T8| ) of Lemma 20 and recall that by 
assumption 12pS"^^^ < 6 — an upper bound of 1/6 follows for the term under consideration. 

Now, for the third term, let m = \{1 — 6/2)M'^ /cf]. Recall that conditioned on = n', the random 
variable L{H') follows the same distribution as lis{n'). Thus, since n' > n implies that lis{n') dominates 
lis{n), some basic arithmetic and given that (1 + x)" < 1 + ax for a; > — 1 and < a < 1, 



Pr 



L{H') < (1 - 5)M, i^;'! > (1 - V2)^ 



< Pr 

< Pr 



^ - (1 - 5/2)1/"! <i 
Us.im) < (l - ^ (l - ^ 



d - 
l/d 



< Pr [lisdim) <{1-6)M] 



Cdin 



< Pr 

i/d 



lisdim) < (1 - 6/2)^-^/'^Cdm^/'^ 



Setting t = ((5/2)(l - l/d), requiring that C > (mo/(l - 5/2))'^^'' with mo = mo{t,l/6,d) as in Corol- 
lary 17 and since by assumption N'^p > C^, we get 



m > (1-5/2) — 
4l 



{l-5/2)N'^p > {l-6/2)C'^ > mo 



Thus, we can apply Corollary [Tt] and conclude that the third term is also upper bounded by 1/6. 
Summarizing, Pr [L{H) < (1 - 6)M] < 3(1/6) = 1/2 as we wanted to show. ■ 



Corollary 22 Let d > 2. If t = 1/p, then the model {Q(Kni,...,najP)) of internal parameter t admits a 
(c, A, 9)-median where 

( \ a\ ( 1 2d-l 



d' 2d{d-l) 

Proof: As usual, let N and S denote the geometric mean and sum of ni , . . . , Ud- Let H be chosen according 



to g{Kn^^...,na,P)^ M = cN/t^ = CdNp^/'^, 5 > 0, and C{5) be as in Proposition |2l[ Define a{6) = C{5), 
b{5) = (12/((5(i'^-icrf))i/(2"'-2) and t'{5) sufficiently large so t > t'{6) and t^-i/M < {d/i2)t{b{6))'^-^ 
Note that if t > t'{6), N > a{6)t^/'^, and Sb{6) < ti'^d-i) / {2d{d-i)) ^ ^.^^^ jj^g hypothesis of Proposition 
will be satisfied, and thence every median of L{H) will be between (1 — 5)M and (1 + 5)M. ■ 
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Recalling that by Proposition 10 we know that h = 1/4 is a concentration constant for the d-dimensional 



binomial random hyper-graph model, by Corollary [22] and the Main Theorem, we obtain the following: 

Theorem 23 Let e > and 5 : R — > R fte such that g{t) = 0{t^)for a given < t] < l/{2d{d — 1)). Fix 
ni, . . . ,nd and let N and S denote their geometric mean and sum, respectively. There exists a sufficiently 
small po and sufficiently large A such that ifp < po, Np^^^ > A and S < g{l/p)N, then for M = CdNp^/'^ 
where Cd is the d-dimensional Ulam constant, 

(l-e)M < E[L{g{Kn,,...,n„p))] < (l + e)M, 

and the following hold: 
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• If Med [L{g{Kni,...,na^P))] is a median of L{Q{Knj^^...,na^P))' 

(l-e)M < Med [L{g{Kn„...,n„p))] < (l + e)M. 

• There is an absolute constant C > such that 

Pr [L{G{Kn,,...,n,.V)) < (1 - e)M] < exp {-Ce^M) , 
Pr[L{g{Kn„...,n,,p))>{l + e)M] < exp (^-C^m) . 

We are now ready to prove Theorem[6]which is this section's main result, and was already stated in the main 
contributions section. 

Proof of Theorem |6| Let n,n' , n" be positive integers such that n = n' + n" . Clearly, 



L{Q{Kl^\p)) 



> E 



L{g{Kl^\p)) 



By subadditivity, it follows that the limit of E 



+ E 



5p = inf„gN E 



L{g{Ki''\p))/n 



A direct application of Theorem 



when normalized by n exists and equals 
yields that 6p/ ^ — )• q when 
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4.2 Random word model 

In this section, we consider the random d-word model. The structure, arguments and type of derived results 
are similar to those obtained in the preceding section. However, the intermediate calculations are somewhat 
longer and more involved. We omit the proofs of this section's results from the current draft. 

As in the preceding section, we first show that the random model under consideration admits a (c, A, &)- 
median. Now consider H chosen according to ... A;) and let H' be the hyper-subgraph of H 

obtained from H as in the preceding section (i.e. by removal of all edges incident to nodes of degree at 



least 2). Let E = E{H) and E' = E{H'). For the random word model, the analogue of Lemma 20 is the 
following: 

Lemma 24 Let N and S be the geometric mean and sum of positive integers ni, . . . , rid, respectively. Then, 



^m] = 

^[\E\E'\] < 
Moreover, for all r] > 0, 



> (^ -- 
- kF^ \ ~ k 



k^ 



(20) 
(21) 
(22) 



Pr[|^'|-E[|£;'|] >r?E[|^'|]] < 



+ 



r]'^E[\E'\] rj^ \ \k-2 



k-l 



2d-l 



We can now determine an estimate the median of L(S(i^„^ ... k). 
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Proposition 25 Let 5 > ^, d > 2, and N and S be the geometric mean and sum of positive integers 
ni, . . . ,nd, respectively. Moreover, let M = CdN/k^^^^'^ where Cd is the d-dimensional Ulam constant. 
Then, there are sufficiently large constants C = C{6) and K = K{6) such that: 

• Ifk>K, N > Ck^-^/'^, 125"^ < 5cdk'^~^+^''^, and S <k/2, then any median o/L(i;(K„,,...,„^, k) 
is upper bounded by (1 + 5)M. 

• Ifk>K,N> Cfci-Vrf andS < 5k/2, then every median o/L(S(i^„^^,..^„^, A;) is at least {\ — 5)M. 
Corollary 26 The model h)) of internal parameter k admits a (c, A, 0)-median where 



Recalling that by Proposition 10 we have that h = 1/ (id) is a concentration constant for the random d-word 



model, by the preceding corollary and the Main Theorem, we obtain the following: 

Theorem 27 Let e > and g : H ^ H be such that g{k) = 0{k^) for a given < rj < Fix 
ni, . . . ,nd and let N and S denote their geometric mean and sum, respectively. There exists sufficiently 
large constants ko and A such that if k < ko, Nk^^^^'^ > A and S < g{k)N, then for M = CdN/k^~^^^ 
where Cd is the d-dimensional Ulam constant, 

(l-e)M < E[L{^{Kn„...,n„k))] < (l + e)M, 

and the following hold: 

• If Med A;))] is a median of L{Ti{Kn-^,...,nci^k)), 

(l-e)M < Med[L(S(i^„,„..,„„A;))] < (l + e)M. 

• There is an absolute constant C > such that 

Pr [L(S(K„,,...,„„ k)) < (1 - e)M] < exp f-^e'M^ , 



Pr[L{T.{Kn,,...,n,.k))>{l + e)M] < exp 



d 

' dl + e' 



-M . 



We are now ready to prove Theorem [7] which is this section's main result, and was already stated in the main 
contributions section. 

Proof of Theorem |7| Let n, re', re" be positive integers such that n = n' + re". Clearly, 



> E 



+ E 



By subadditivity, it follows that the limit of E L{T,{kIi \ k)) when normalized by re exists and equals 
L{Ti{K^\ k))/n . A direct application of Theorem 



Ik = inf„eN E 
k ^ CO. ■ 
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yields that k^ ^^^Jk — ^ Q when 
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4.3 Symmetric and anti-symmetric binomial random graph models 



Throughout this section we focus on the study of L(V) when V is either S{Kn.n,p)) or A{K2n,2mP) as 
defined in the introduction to this work. 

First, we study the behavior of L{G) when G is chosen according S{Kn,n,p)- Recall that in this case, 
the collection of events {{x, y), (y, x)} C E{G) are independent, and each one occurs with probability p. 
Also note that (x, y) G E{G) if and only if (y, x) G E{G) — any graph for which this equivalence holds 
will be said to be symmetric, thus motivating the use of the word "symmetric" in naming the random graph 
model. As usual, we begin our study with the determination of the concentration constant for the random 
model under study. 

Lemma 28 The concentration constant for (S (Kn^n, p))ne'N is 1/4. 

Proof: Direct application of Talagrand's inequality (as stated in HJLROOi Theorem 2.29]). ■ 



As in the study of the binomial model (Section 4.1 ) and the word model (Section [4^2)), given a graph 



G chosen according to S{Kn,n,p) we will consider a reduced graph G' obtained from G by removal of all 
edges incident to nodes of degree at least 2. An important observation is that the graph G' thus obtained is 
also symmetric. Since G' is symmetric, the number of vertices of degree 1 in each of the two color classes of 
G' must be even, say 2m. Thus, the arcs between nodes of degree 1 in G' can be thought of as an involution 
of [2m] without fix points. In fact, given that the distribution of G' is invariant under permutation of its 
nodes, the distribution of G' is also invariant under such permutation, and the resulting associated involution 
is distributed as a random involution of [2m] without fix points. We shall see that under proper assumptions 
L{G) and L{G') are essentially equal — thus, L{G) behaves (approximately) like the length of a longest 
increasing subsequence of a randomly chosen involution of [2m] without fix points. This partly explains our 
recollection below of some results about the length of a longest increasing subsequence of randomly chosen 
involutions. 

Let l2m be the distribution of a uniformly chosen involution of [2m] without fix points. Let L{l2m) 
denote the length of the longest increasing subsequence of an involution chosen according to l2m- Bark and 
Rains liBROlJ showed that the expected value of L(X2m) is roughly 2\/2m, for m large. Moreover, Kiwi [, 
Theorem 5] established the following concentration result for L{l2m) (we state the result in a weaker form): 



Theorem 29 For m sufficiently large and every < s < 2\j2m, 



Pr 



|i(2:2,n)-E[L(X2m)]| > s + 32(2m: 



,1/4 



< 4e" 



s2/16e3/2v^ 



Corollary 30 For every < t < 1 and a > there exists a mo = mo{t, a) sufficiently large such that for 
all m > mo, 



Pr 



2m) 



2V2m 



> 2tV2m 



< a. 



Proof: Let mo = mo{t, a) be sufficiently large such that Theorem 29 and the following conditions hold 
for all m > mo: 

• I E [ L{l2m) ] - 2V2m\ + 32(2771)1/"^ < tV2m. 
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It follows that 



Pr 



L{l2m) - 2V2^ 



< Pr 

< Pr 

< 4e' 



> 2tV2m 
E[L{l2m)]\ >2tV2m 



2m 



2V2m 



\L{l2^)-E[L{l2m)]\ >tV2^ + 32{2m) 



1/4 



We now proceed to show that the symmetric random model S{Kn^n,p) admits a (c, A, 6')-median where the 
constant c is related to a constant that arises in the study of the asymptotic behavior of L{X2m)- We will 



need the following analogues of Lemmas 20 and 24 



Lemma 31 Let n be a positive integer. Let G is chosen according to S{Kn^n,p)- If E and E' denote E(G) 
and E{G'), respectively, then 



Moreover, for r] > 0, 



E [ \E\ ] = pn{n — 1) , 
E[\E'\] = pn(n-l)(l-p)2"-^ 
E[\E\E'\] < 2p2n(n-l)(n-2). 



Pr[||^|-E[|^|]| >r/E[|^|]] < 



T^[\E\ 



(23) 
(24) 
(25) 



(26) 



Proof: [Sketch] For i ^ j, let Xij and Yij denote the indicator of the event (i, j) G E and (i, j) G E', 
respectively. Observing that E[Xij] = p, E[Yij] = p{l - pf""'^, \E\ = j^ij-.i^j and \E'\ = 
Yij, yield (|23]) and (|24]). Since E' C E, it follows that \E\ E'\ = \E\ - \E'\. Identity (|26]l follows 
from and (|24]l observing that (1 - > i _ (2n - 4)p. 



To establish ( |26l ) we observe that \E\ can also be expressed as 2 Yl,i<j and that { Xij | i < j } is a 
collection of independent random variables. To conclude, note that 



def 



E . (;)((»)-:) 



(i,3),(k,l):i<3,k<l 
(i,j)^(k,l) 



and apply Chebyshev's inequality for indicator random variables to conclude (26 1 



Proposition 32 Let (5>0, 0<p<l and n be a positive integer There is a sufficiently large constant 
Ci = Ci{5), and sufficiently small constants C2 and C3, such that 

• IfCi/p < n2 < C25/p^/^, then every median of L{S{Kn,n,p)) is at most 2(1 + 5)n^. 

• If Ci/p < v? < C^d'^ /p^, then every median of L{S{Kn,n,p)) is at least 2(1 — 5)n^. 



Proof: Similar to the proof of Proposition 21 
We immediately have the following: 
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11 



12 



G 





Figure 6: An illustration of a graph G in the support of S{Ki2^i2,p) (top), and the graph O in the support 
of 0{Ki2^i2,p) obtained from G by removal of all edges (x, y) such that x > y (bottom). Thicker edges 
represent a non-crossing matching M of G (top), and the associated non-crossing matching of O with 
edge set { (min {x, y} , max {x, y}) \ {x, y) G E{M) } (bottom). 

Corollary 33 The model {S{Kn,n,p))ne'N of internal parameter t = 1/p admits a (2, 1/2, 3/ 4) -median. 
We now define an auxiliary distribution which will be useful for our study: 

• 0{Kn,n,p) (the oriented symmetric binomial random graph model) — the distribution over the set 
of subgraphs H of Kn,n where the events { H \ G E{H) } for 1 < i < j < n, have proba- 
biUty p and are mutually independent, and the events { H \ G E{H) }, 1 < j < i < n, have 
probability 0. 



(See Figure 4.3 for an illustration of the distinction between distributions S{Kn,n,p) and 0{Kn,n,p)-) 



The following result justifies why we can henceforth work either with L{S{Kn^n,p)) or L{0{Kn,n,p))- 

Lemma 34 The random variables L{S{Kn^n,p)) cind L{0{Kn^n,p)) cif^ identically distributed. 

Proof: Let O be a graph in the support of 0{Kn^n,p)- We can associate to O a graph G over the same 
collection of vertices and having edge set { (x,y) | {x,y) G E{0) or (y, x) G E{0) }. Clearly, G is a 
symmetric subgraph of Kn,n and hence it belongs to the support of S{Kn^n,p)- It is easy to see that the 
mapping from O to G is one-to-one. Moreover, the probability of G being chosen under S{Kn,n,p) is 
exactly equal to the probability of occurrence of O under 0{Kn^n:P)- 

On the other hand, if M is a non-crossing subgraph of G, then there is a non-crossing subgraph of O 
(and hence of G), say N, whose size is the same as the one of M. Indeed, it suffices to take as the collection 



of edges of N the set { (min {x, y} , max {x, y}) | (x, y) G E{M) }. (See Figure 4.3 for an illustration of 



the relation between M and A^.) We get that L{G) = L{0), which concludes the proof. ■ 
We are now ready to prove the main result of this section. 

Theorem 35 For every e > there is a sufficiently small constant pQ and a sufficiently large constant A 
such that for all p < po and n > A/^, 

(l-e)2nVP < E[L{SiKn,n,p))] < {I + e)2n^ , (27) 

and the following hold 
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• If Med [L{S{Kn,n,p))] is a median of L{S{Kn^n,p)), 

(l-e)2nVP < Med[L{SiKn,n,p))] < (1 + e)2n^ . (28) 

• There is an absolute constant C > 0, such that 

?r[L{S{Kn,n,p))<{l-e)2n^] < exp (-Ce^n^p) , (29) 
Pr[L{S{Kn,n,p))>{l + e)2n^] < exp (^-C^nVp) • (30) 

Proof: Unfortunately, (5(K„^„,p))neN is not a random hyper-graph model, so we can not immediately 
apply the Main Theorem. However, it is a weak random hyper-graph model. Hence, to prove the lower 
bound in ( |27| ) and ( 28 1, and inequality ( 29 1, we use the fact that the model S{Kn^n , p) with internal parameter 
t = \/p has a concentration constant /i = 1/4 (Lemma 28 1 admits a (2, 1/2, 3/4)-median (Corollary 33 1, 
and apply the Main Theorem. 

To prove the remaining bounds, consider a bipartite graph H chosen according to Q{Kn^n,p), and let O 
be the graph obtained from H by deletion of all its edges (x, y) such that x > y. Since O is a subgraph of H, 



it immediately follows that L{0) < L{H). Note that O follows the distribution 0{Kn,n,p)- By Lemma 34 



L{0) has the same distribution as L{S{Kn,n,p)- Hence, if n and p satisfy the hypothesis of Theorem 23 



E[L{SiKn,n,p))] = E[L{0)] < E[LiH)] < (1 + e)2n^ , 
Med[L{S{Kn,n,p))] = Med[L(0)] < Med[L{H)] < {1 + e)2n^ , 



and provided C is as in Theorem [23| 

Pr[L{S{Kn,n,p))>il + e)2n^] = Pr [ L(0) > (1 + e)2nVp] 
< Pr [L{H) > {l + e)2n^] < exp (^-Cr^^2n^^ . 

This concludes the proof of the stated result. ■ 

We can now establish Theorem [8] 
Proof of Theorem |8| Let n,n' , n" be positive integers such that n = n' + n" . Clearly, 

E[L{S{Kn,n,p))] > E[L{S{K^,,n',p))]+E[L{S{Kr,,>,n",k))] . 

By subadditivity, it follows that the limit of E [L{S{Kn,n,p)) ] when normalized by n exists and equals 
Up = min£t<s E[L{S{Kn,n,p)) /n]. A direct application of Theorem [8] yields that cjp/^ — 2 when 
p^O. U 

One can also show, although not as straightforward as for the case of the symmetric binomial random 
graph model, that the following analogue of Theorem 35 holds for the anti- symmetric case. 

Theorem 36 For every e > there is a sufficiently small constant pq and a sufficiently large constant A 
such that for all p < po and n > A/^Jp, 

(l-e)4n^ < E[LiAiK2n,2n,p))] < (l + e)4n^, (31) 

and the following hold 
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• If Med [L{A{K2n,2n,p))] is amedian of L{A{K2n,2n,p)), 

(l-e)4nVp < Med [LM(K2n,2n,p))] < (1 + e)in^ . (32) 

• There is an absolute constant C > 0, such that 

Pr[L{A{K2n,2n,p))<{l-€)An^] < exp (-Ce^n^p) , (33) 
Pr[L{A{K2n,2n,p)) > (1 + e)4n^] < exp (-C^T^^Vp) • (34) 

Proof: Omitted from current draft. ■ 

Theorem |9] can now be established much in the same way as Theorem [8] was derived. 
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